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Abstract: Weak convergence of various general functionals of partial sums 
of dependent random variables to stochastic integral now play a major role in 
the modern statistics theory. In this paper, we obtain the weak convergence 
of various general functionals of partial sums of causal process by means of 
the method which was introduced in Jacod and Shiryaev (2003). 

Keyword: Weak convergence, causal process, stochastic integral. 

1 Introduction 

Weak convergence of stochastic processes is a very important and foun- 
dational theory in probability. In his classical textbook, Billingsley (1968) 
gave a systematic theory of weak convergence for stochastic processes. In 
the theory, finite dimensional distribution convergence and the tightness of 
stochastic processes are crucial. Partial sum processes of random variables 
and empirical processes are very important processes in the probability and 
statistics. We can establish the weak convergence of partial sum processes to 
Brownian motion and empirical processes to Brownian bridge by the classical 
method. 

With the quick development of modern statistics and econometric theory, the 
classical method has become difficult to deal with more complex processes. 
For example, sometimes we need to prove a convergence theorem about the 
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stochastic integrals. However, we can not get the weak convergence easily 
since it is difficult to compute the finite dimensional distributions of the 
stochastic integrals. The convergence theorem of stochastic integrals is a 
core theory in the unit root theory, which is a hot topic in the econometric 
theory, (c.f. Phillips (1987 a,b), (2007)). As mentioned in Ibragimov and 
Phillips (2008), the results of this type can be used in the study of transition 
behavior between regimes and marked intervention policy. However, earlier 
authors only obtained some results which describe the convergence to simple 
stochastic power integrals by the classical weak convergence theory, since it 
is complex to compute the finite dimensional distribution of more general 
stochastic integral, (c.f. De Jong and Davidson (2000 a, b)). 

In fact, when we intend to get a weak convergence result for a stochastic 
process sequence, we need to complete two tasks : one is to prove the rela- 
tive compactness of the stochastic process sequence, and the other one is to 
identify the limiting process. In the Billingsley's classical method, tightness 
is used to provide the relative compactness, while the convergence of finite 
dimension distributions identifies the limiting process. Jacod and Shiryaev 
(2003) developed a new approach to weak convergence of semimartingale 
sequences. Firstly, they introduced three characteristics to replace the three 
terms: the drift, variance of the Guassian part and Levy measure, which 
characterize the distribution of Levy process. By means of these three char- 
acteristics, one can show tightness of semimartingale sequence. Secondly, 
they characterized the law of limiting process as the unique solution of some 
martingale problem. In some special cases, the unique solution of a mar- 
tingale problem can be seen as a unique solution of stochastic differential 
equations (for example, the limiting process is a stochastic integral). Hence, 
Jacod and Shiryaev's method is a powerful tool to prove the limit theo- 
rem about the semimartingale. Because of some technical difficulty, this 
method is rarely used in the statistics and econometric theory. Ibragimov 
and Phillips (2008) used this method to obtain the weak convergence of var- 
ious general functionals of partial sums of linear processes. This type of 
results can be used in the unit root theory, where they deal with the func- 
tional of partial sum of linear processes as a semimartingal and employ the 
Beveridge-Nelson decomposition to compute three predictable characteris- 



2 



tics of the underlying semimartingale. 

In this paper, we extend these results to a causal process, which is an im- 
portant class of stationary processes. Wu (2005, 2007) developed a complete 
method and theory about the causal process. By means of the martingale 
approximation developed by Wu (2005, 2007), we obtain weak convergence 
of various general functionals of partial sums of causal processes. In fact, 
the martingale approximation is an extension of Beveridge-Nelson decompo- 
sition for a linear process. We will use martingale approximation twice to 
obtain our results. 

The remainder of this paper is organized as follows. Section 2 gives a short 
introduction of the martingale convergence method developed by Jacod and 
Shiryaev (2003). Section 3 gives some definitions and notations about a 
causal process. Section 4 presents our main result. The proof of the theorem 
will be collected in Section 5. Section 6 gives a simple application of our 
result to unit root autoregression theory. Some discussion about the further 
research is given in Section 7. 

2 Martingale Convergence Method 

2.1 Definitions 

In this subsection, we present some notations and preliminary results. 

LetM+ = [0,+oo)andZ = {--- , -2, -1, 0, 1, 2, • • • }. ((1, 3?, F = {^ t )t>i, P) 
is a filtered probability space. X is a semimartingale defined on (fi, = 
(&t)t>i,P)- Set h(x) = xl| x |<i, and 

X(h) t = J2 s < t [AX s -h(AX s )\, 
X(h) = X-X(h), 

X(h) is a special semimartingale and we consider its canonical decomposi- 
tion: 

X(h) = X + M(h) + B(h), (2.1) 
where M(h) is its local martingale part, B(h) is its finite variation part. 
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Definition 1 (Jacod and Shiryaev (2003) ) We call predictable characteris- 
tics of X the triplet (B, C, v) as follows: 

(1) B is a predictable finite variation process, namely the process B = B(h) 
appearing in (2.1). 

(2) C =< X C ,X C > is a continuous process, where X c is the continuous 
martingale part of X. 

(3) v is a predictable random measure on M.+ x R, namely the compensator 
of the random measure [i x associated to the jumps of X, \i x is defined by 

fi x {uj;dt,dx) = ^2l{AX,(u)^o}£(s,AX 3 )(dt,dx), 

s 

where e a denotes the Dirac measure at the point a. 

Remark 1 By means of the truncation function h(x), the semimartingale 
X can be divided into two parts: the jumps of one part are greater than 1, 
and the jumps of the other's are not. When a semimartingale's jumps are 
bounded, this semimartingale is a special semimartingale, in the other words, 
it has unique canonical decomposition, and hence we can get an unique B 
in Definition 1. If the semimartingale is a special semimartingale, it is not 
necessary to introduce the truncation function. In this paper, we discuss 
such semimartingale, and so do not introduce the truncation function. 

Remark 2 The predictable characteristics of semimartingale X are the 
counterpart of the drift, variance of Guassian part and Levy measure of 
independent increment process. By means of predictable characteristics, 
one can characterize the asymptotic properties of the semimartingale. 

If {Yfe, k > 0} is a discrete time semimartingale on probability space (fi, P), 
we can write 

k k k 

Yk = ^2vi = m + ^2 m i + ^2bi, 

i=0 i=l i=l 

where rj = Y , r) i = Y i - Y^, rrii = rji - Efaffi-i) and bi = Efaffi-i), 
i > 1. 

Set X s = Yj s j. From Definition 1, we can get the first and second predictable 
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characteristic of X s : 



B s = J>, °s = E^K 2 |^-i)- (2-2) 

i=0 i=0 

The third predictable characteristic of X s is a compensated random measure 
v. For a continuous function g in R, we have 




g(x)u(dx, dt) = E{g{rH)\&i-i). (2.3) 



2.2 Convergence of Semimartingales Using Predictable Char- 
acteristics 

Definition 2 (Jacod and Shiryaev (2003)) Let X be a cadlag process and let 
T~L be the a— field generated by X(0) and £o be the distribution of A(0). A 
solution to the martingale problem associated with ("H, X) and (Co, B, C, v) 
(denoted by s(a(Xo), X\£q, B, C, v)) is a probability measure P on (Q,^) 
such that X is a semimartingale on (fi, P) with predictable characteristics 
(B,C». 

The limit process X = (X(s)) s >o appearing in this paper is the canonical 
process X(s,a) = a(s) for the element a = (a(s)) s >o of £)(R + ). In other 
words, our limit process is defined on the canonical space (D(R + ), £F(R + ), D). 
For a > and an element (a(s),s > 0) of the Skorokhod space B(R + ), define 

S a (a) = inf(s : \a(s)\ > a or |a(s-)| > a), 

SZ = mi(s:\X n {s)\>a). 

In the paper, => denotes convergence in distribution in an appropriate metric 
P 

space, and — > denotes convergence in probability. The following theorem 
gives sufficient conditions for the weak convergence of a sequence of square- 
integrable semimartingales. This theorem provides the basis for the study 
of asymptotic properties of functionals of partial sums. 

Theorem A (Jacod and Shiryaev (2003)) Suppose that the following con- 
ditions hold: 
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(i) . The local strong majoration hypothesis: for all a > 0, there is an 
increasing continuous and deterministic function F(a) such that the stopped 
processes Var(B) sa , C sa and (\x 2 * z^l) 5 " 1 are strongly majorized by F(a). 

(ii) . The local condition on big jumps: for all a > 0, t > 0, 

lim sup|x 2 |l/| x | >b i *u tAS a(a) = 0. 

(iii) . Local uniqueness for the martingale problem q (a (Xq ), X|£o> B, C, v); 
We denote by Q the unique solution to this problem. 

(iv) . Continuity condition: for all t G D, g G C(R), the function a ~^ 
B t (a), C t (a),g*is t (a) are Skorokhod-continuous on D(i?), where D is a dense 
subset of R + . 

(v) . £q — >■ £o weakly as n — >■ oo. 

(vi) . 5 * i/» s „ - (5 * ^as«) o (X™) A for alU G £>, a > 0, g G C + (i?); 

sup s < t |B t n AS „ - (5 (AS «) o (X n )\ A for alU > 0, a > 0; 
C? AS a - (C tAS a) o (X n ) A for all t G D, a > 0; 
lim^oo limsup n P(\x 2 \l{\ x \ >b } * i^go > e) = for alii > 0, a > 0, 

e > 0. 

Then Jz^(X n ) => Q. 

2.3 Uniqueness Conditions for Homogenous Diffusion Pro- 
cesses 

The limiting process in Theorem A usually can be seen as a homogenous 
diffusion process. In this paper, the stochastic differential equation which is 
discussed has a homogenous diffusion process solution, we need a theorem to 
assure that this stochastic differential equation has a unique and measurable 
solution. 

Consider the stochastic differential equation: 

f dX 1 (t) = Xg(X 2 (t))dt + af(X 2 (t))dB(t), 

(2.4) 

dX 2 (t) = dB(t). 

A solution to (2.4) is a two-dimensional semimartingale X := (Xi,X 2 ) with 
the predictable characteristics B(X) and C(X), where, for an element a(s) = 



6 



(ai(s),a 2 (s)) in 



B(s,a) = ( g(a 2 (v))dv,0), 



C(s,a) 



f° f 2 (a 2 (v))dv J°f(a 2 (v))dv 
Jo f(a 2 (v))dv s 



Theorem B (Ibragimov and Phillips (2008) ) Suppose that 

(i) The functions f(x) and g(x) are locally Lipschitz continuous, i.e. for 
every N € N, there exists a constant Kn such that 

\f(x) - f(y)\ < K N \x - y\, \g(x) - g(y)\ < K N \x - y\ 

for all \x\ < N, \y\ < N. 

(ii) / and g satisfy the growth condition: there exists K > such that 

\f{x)\<e K \*\, \g(x)\<e K W. 

Then the stochastic differential equation (2.4) has a unique solution. In 
other words, the martingale problem <j(cr(Xo), X\£q, B, C, 0) has an unique 
solution. 



3 Causal Process and Martingale Approximation 

We call {X n , n > 1}, a causal process if X n has the form 

Xn = d{' j lj ^n)i 

where {e n ;n £ Z} is mean zero, independent and identically distributed 
random variables and g is a measurable function. Causal process is a very 
important example of stationary process. It has been widely used in practice, 
and contains many important statistical models, such as ARCH models, 
threshold AR (TAR) and so on. Asymptotic behavior of the sums of causal 
processes, S n = Y17=i are important subjects in both practice and theory. 
Recall that Z £ L' p (p > 0) if \\Z\\ p = [E(\Z\p )] x /p < oo and write \\Z\\ = 

\\z\\ 2 . 
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To study the asymptotic property of the sums of causal processes, martin- 
gale approximation is an effective method. Roughly speaking, martingale 
approximation is to find a martingale M n , such that the error || S n — M n \\ p 
is small in some sense. We list the notations used in the following part: 

• &k = (■ ■■ ,£k-i,£k)- 

• Projections V k Z = E{Z\& k ) - E(Z\,^ k -i), Z G L 1 . 

• D k = YZk PkX t , M k = Ylt =1 D t ,R k = S k -M k . 

• H k = J2Zi E (x k+ i\^ k ). 

• On,p = WPoX n \\ p „ A n>q = YJi=Q °i,qi n > °- Let ^n,p = = A„ iP if 71 < 0. 

• ©m,p = Yli=m @i,P ■ 

• B: standard Brownian motion. 

M k is a martingale, we will use M k to approximate sum S k . Throughout the 
paper, we assume that D k converges almost surely. 

Linear process is a very important example of causal processes, and many 
methods are developed to discuss it. Phillips and Solo (1992) studied the 
Beveridge-Nelson decomposition of linear processes, and then obtained some 
asymptotic results. This method is used to obtain the asymptotic results of 
short memory linear processes. 

Suppose that U n is the linear process U n = X^o a * e «-*- Appryrng the 
Beveridge-Nelson polynomial decomposition, one can get 



i=0 

where e n = Yl^o^n-i, a-i = YlT=i+i a k- 

In fact, the martingale approximation introduced above is an extension of 
the Beveridge-Nelson decomposition to causal processes. Applying the mar- 
tingale approximate to U n , we get 




(3.1) 



oo oo oo 



^2 P kUi = a *) £fc ' Uk ~ PkUi = £k ~ 1 ~ £k ' 



i=k i=0 i=k 



From Wu (2005), we have 




(3.2) 
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where e' is the independent copy of £q- 



4 Main Result 

Assumption 1. X G C q , q > 2, and 6 Ttj(? * = 0(n 1/9 *~ 1/2 (log n)" 1 ), where 
q* = min(g, 4). 

Assumption 2. 

oo 

Y,\\E(Dt\^o)-v 2 \\ q */2<°o, 
k=l 

where \\Dk\ \ = a. 
Assumption 3. 

oo oo 

Y,Y1 \\E(X k X k+t \^ ) - EiXkXk+^^U < oo, 

k=0 i=l 

and 

oo oo 

||^^£(X fc X fc+i |^o)||3<°0. 
k=0 i=l 

Theorem 1 Let / : R — > R be a twice continuously differentiable function 
such that /' satisfies < i^T(l + |a;| a ) for some positive constants K and 

a and all x G R. Suppose that X t is a causal process satisfies Assumptions 
1~3. Then 

[77,7*1 £ ]_ 

-i^/(i^I,)I t ^ f f'(B(v))dv + a ^ f(B(v))dB(v), (4.1) 
where A = Y%Li EX o x j- 

Remark 1 The assumptions on the function / in this paper is the same as 
that of Ibragimov and Phillips (2008) . Assumption 1 on the causal process is 
more wild than Ibragimov and Phillips (2008) 's. In their paper, they assume 
Yli^i *l a «l < °°> while our condition is weaker from (3.2). 

Remark 2 Assumption 3 is not strong as well. We have 

00 00 00 00 00 

||^^E(A fe A fe+i |^o)||3<0(EE E 1 

a j I \ a j+r I)- 

fc=0 i=l r=0 k=0 j=k+l 

From Lemma E5 of Ibragimov and Phillips (2008), Yli^i ^\ a i\ < 00 implies 

EOO V— VOO V— VOO I I |~ I , 

r=0 Lit=0 Lj=fc+1 \ a j\\ a j+r\ < 00 • 
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5 The Proof of Main Result 



Firstly, we introduce two lemmas. 
Lemma 1 (Wu (2007) ) Assume that E[X ] = 0, X G C q , q > 1, let 
q' = min(2, q), and 9 ,g = Ya^o^^i < 00 ' tnen 

II max|S fc | ||,< *BL n WQ 

k<n q — 1 

where 5, = 18g 3 / 2 (g - l)" 1 / 2 if g € (1, 2) U (2, oo) and B q = 1 if q = 2. 

Lemma 2 (Wu (2007) ) Under Assumptions 1 and 2, there exists a standard 
Brownian motion B on a richer probability space such that 

|5 n - B(a 2 n)\ = O a . s .(n 1 / 4 (logn) 1 / 2 (loglogn) 1 / 4 ). 



Set 



]ns] t-1 [ns] 



to) 



and 



X( S ) = (A [' f'(B(v))dv + * f f(B(v))dB(v),B(s)) =: (X 1 (s) , X 2 (s)) . 
Jo Jo 

By (2.2) and (2.3), we can get the first two predictable characteristics of X n 
as follows: 



[ns] t-1 [ns] 



)), 



C n (s) 



Cn 1 ( s ) Cn 2 ( s ) 

t-i 



c «( s )%E/ 2 (iEw 2 i^). 

" t=2 Vn i=l 
[ns] 

c 22 ( s ) = i^E(A 2 |^-i), 
n t=i 

^ 2 ( S ) = c 2i (.) = -E/(^E^(a 2 i^-i)- 

71 t=2 Vn i=l 
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The process X(s) = (X 1 (s) , X 2 (s)) , an element of the Skorokhod space 
D(R+), is a solution to the stochastic differential process 



To prove the main result, we need to verify the conditions in Theorem A. 

By the similar argument in Ibragimov and Phillips (2008) (3.15-3.17), we 
can get that condition (i) in Theorem A is satisfied. 

Since the limiting process is continuous, condition (ii) and condition (iv) in 
Theorem A don't need to be verified. 

Under the assumptions of the theorem, function f(x) and f'(x) are locally 
Lipschitz continuous and satisfy growth condition. From Theorem B, the 
stochastic differential equation has a unique solution. In other words, the 
martingale problem ?(<t(Ao), X\Co, B, C, v) have unique solution. We can 
get that condition (iii) in Theorem A is satisfied. 

Since Cq = 0, Cq = 0, it suffices to check condition (vi) in Theorem A. 
From Jacod and Shiryaev (2003), if we can show 



< 



dX 1 ^) = \f'(X 2 (t))dt + af(X 2 (t))dB(t), 
dX 2 {t) = dB(t). 



(5.1) 



The predictable characteristics of X are (B(X), C(X), 0): 





0-2 lo f 2 (. a 2(v))dv a J Q S f(a 2 (v))dv 
a Jq f(a 2 {v))dv a 2 s 




(5.2) 



0<s<AT 



then g * i/^ s „ - {g * u tASa ) o (X n ) A for alii G D, a > 0, g € C+(R). 



Secondly, we need to compute the terms of B(s) o X n , C(s) o X,, 
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We divide the proof into three steps: 
Step 1 We prove 

sup \C^(s)-C^(s)oX n \ Ao, (5.3) 

0<s<JV 

where iV is a positive integer. Since the proofs for i, j = 1, 2 are similar, we 
only consider the case of % = j = 1. In fact we need to show that 



su p I^E/ 2 (^E^)(^(A 2 l^-i)-^ 2 )-^/ 2 (^E^)(" s -H)l Ao, 

(5.4) 

Firstly, we prove 



[ns] t—1 [ns] t—1 

1 M 1 t-i 1 M ., t-i 1 t-i -. t-i 

-p i i E/ 2 (^E^)- 1 E/ 2 (^EA)i< 1 ma X j/ 2 (^E^)-/ 2 (^Ea)i, 



t-l - [nsj -, t-1 

0<s<N 

Since 

t-1 , [nsj , t-l t-l _, t-l 

0<s<Af 

and / is a uniform continuous function, we can get (5.5) by 

j t-i ^ t-i 

max |— VjSQ-— VA|A0. (5.6) 

v «=1 v «=1 

For any e > 0, by Lemma 1, for 2 < q < 4, we have 

p { ' m „ iy>-y>i>e)< *eS(*-".>p <c .%.)- 

>i<KnAf IZj ^ 1 7 - ne 2 - ne 2 

i=i «=i 

which implies (5.6), then we obtain (5.5). 

By the martingale version of the Skorokhod representation theorem, on a 
richer probability space, there exist a standard Brownian motion B and 
nonnegative random variables Ti,T2,--- with partial sums T k = Yli=i r « 
such that for k > 1, T k - ka 2 = o a . s .(/c 2 / g ) and M fc = B(T fc ), E{T k \& k - X ) = 
E(p\\$ k _ x ). (cf. Wu (2007)). 
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For < s < we consider 

t-i 



2nW = -E / 2 (4= E a)* 2 + -/ 2 (4= E a 



){ns — [ns]). 



By the martingale version of the Skorokhod representation theorem, we have 
X n {s) = E f 2 ( B ^^T + ^fW^Mns - [ns]). 

lb lb lb lb 

t=2 

Since T k - ka 2 = o a . s .(& 2/9 ), 

max|B(— )-£(—)! < max sup \B(-) - B(— )| 

t<k n n t<k \ x _ a 2 A < k 2, q n n 

< O^Xk^y^gk). 

By the uniform continuous property of / and the similar argument in (5.5), 
we have 

sup |X n ( S )-EV 2 (^(^^))-+-/ 2 (S(^ :i ^))(n S -H)| Ao. 
0<s<N ~ n n n n 

By the Riemann approximation of stochastic integral, and the continuity of 
Brownian motion's paths, we have 



0<s<N 
1 WH f2( 1 v^t-l 



sup \l n {s) - [ S f 2 (B(v))dv\ A 0. (5.7) 

<s<N Jo 



For iEiri/ 2 (^E'=i^)^(A 2 |^-i), and by noting M fc = B(T k ), we 
have 

, M -, t-i , M , 

sup |-^/ 2 (-=^x 4 )i?(A 2 l^-i)--E/ 2 (^ i? ( r *-i)) i:; (^ 2 l^- 

-, t-1 , [ns] 

= su p i-E/ 2 (^E^) i? ( r '- T '-ii^-i)--E/ 2 (^( r *-i)) £; ( T *- r *-ii^-i)i ( 5 - 8 ) 

o<s<N nf^ Vnfr! n 7^2 



)\ 



A 0. 



by Lemma 2. 

By the Riemann approximation of stochastic integral and the Approximated Lapla- 
cians property (cf. Dellacherie and Meyer (1982)), we obtain 

sup |-V/ 2 (— B^OTO-Tt.!^.!)- / f 2 (B(v))dv\ A 0, 
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and combining with (5.8), we have 

sup ^^(Tt-T^I^.O- / / 2 (B(v))dv| A 0. (5.9) 

By (5.7) and (5.9), we obtain (5.3). 
Step 2 We prove 

sup \B n {s) - B(s) o X n \ A 0, (5.10) 

0<s<JV 

which will be proved if we show 

-, H -, t-i > H -, t-i 

^TnW ( T"^ X ' KX ' " A) ~ "§ " ° <5 ' n) 

We have 

* : ^ l ^S /( ^S w '- A, ^S / ' < ^S x * )l 

v t=2 v i=l t=2 v i=l 

- i-7= E(/(-7= E *) - E ^))^ - 4/(4 E - - E /'(4 E *<)i> 

and 

max Jfe < max |-L/(-L VX;)/^ + max |- V /'(-L V Xi){X t H t - A)| 

l<k<nN l<k<nN'^/n \fh ^ l<k<nN n ^ y/n ^ 

V t — 2 V 2 — 1 V 2 — 1 V 2 — 1 V 

We firstly prove 

^I^E^E^A-AJIAO. (5.12) 

_ _ t=2 v i=l 

Set F tJ = £(X t X t+j |J? t ) - £(X t X t+j ), we prove 

1 fe 1 t— 1 CO 

^j-E/^E^E^oi-o- (5-i3) 

- - t=2 v i=l j=l 

We approximate S< := X^li by £>t := X)At 'Pt(Sk), then we need to prove 

1 fe . t-i 

max |-V /'(-=VXi)5 t | Ao (5.14) 
Kk<nN n ^ Jn ^ 
~ ~ t=2 v i=l 
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and 

t-i 



^E/'^E^KSt-A)! AO. (5.15) 

12 t=2 V n i=l 

For (5.14), we have 



max 

l<fe<nJV 1 n 



t-i 



e (/'(4E^)A) 2 < * £(/'(-^=X>) 4 )£(A) 4 , 

* 2—1 \ * i=l 

oo oo 

[S(A) 4 ] 1/4 = [25(E W*)) 4 ] 1/4 ^ E ll^(S fc )IU. 
fc=i fe=t 

However, by Assumption 3, we have 

oo oo oo 

E Il p t( s fc)ll4 - E E l|E(*fc*fc+i|-£i) - EiXkXk+il^,)^ < oo. 

fc= t fc=f 2—1 

Since /'(x) < C(l + |x| Q ), we have 

W-^E X *) 5 *) 2 <i (5-16) 

by Lemma 1. Then, by Kolmogorov inequality for martingale, for any e > we 
have 



< 



1 fe . t-i 
P( max |-V f(-=Y" Xj)5 t | >e) 

_ _ t=2 v i=l 



Nmax 1<t<Nn E[f'(±Y,]ZiX i )D t ]* i 
< 5^ < C- -> 0. 



(5.14) is proved. 
For (5.15), we have 



oo oo 



S t -D t = Z t _ 1 - Z u Z t = J2J2 E ( x t+i+kX t+ i\^t), (5.17) 

i=l fe=l 

and 

^ £ ^ ^ t— 1 

i<& ^ E /'(4r E - A)| = . max.. |± ^ /'( J= E - - i ; 



Y f'(^YXi)(St- D t )\ = max | - V /'(-= V Xi)(Z t 

i=2 v i=l ^ ^ t=2 v i=l 

^ E + ^ £ E(/'(^ E *) - /'(^ E 

v z— 1 i— 2 v z— 1 v i—1 
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/ \tH 1 V M \^k\ N\X k Z k \ „ 

< max / (— = > Xi)\ max h max = — sup f (t). 

l<k<nN \/n Z — 1 l<k<nN n l<k<nN y/n E fc_ x . 

1-1 ' l*l<max < fc <„jv ' 

Under Assumption 3, by law of large number, we have 

1 p _ 1 p 

max n~ 6 \Xk\ — > 0, max n 3 \Zk\ — > 0, 

l<fe<niV l<fc<nAT 

so 

1 p 

max -^|A fc Z fe | — ■> 0. 

l<fc<nJV y'n 

From Lemma 2, we have ^= S»=i = ^p(-0i by the continuity of f"(x), we 
obtain (5.15). 

We have, by Talor expansion, that 

V t — 2 V 2—1 V l — l V Z— 1 v 

< (N/2) max ^=X%\H k \ sup /"(t) 

l<k<nN y/n £ fe x . 

|t|<<max < fc <„jv - 

Under Assumption 3, by law of large number and similar argument in the above, 
we get that 



k t t— 1 t— 1 

max i4=y\f(4=y^i)-/(4=^*i) - f(^y"^)4^i 

T, — ^ % — X % — 1 % — 1 



Then we can easily get (5.10). 
Step 3 

We prove (5.2) and 

hmlimsupP(|x 2 |l {N>b} * v™ hSa > e) = (5.18) 

ojoo n 

for alU e D, a > 0. 

For (5.2), similar to Ibragimov and Phillips (2008), 

1 . fc „ 1 1 

sup \AX n (s)\< max \f(-='y]X i )\ ■ max -=\X k \ + max -i=\X k \. 

Q <S <N 0<k<nN y/n 0<k<nN y/n 0<k<nN y/n 

(5.19) 

From Lemma 2, wc have ^= X^i=i -^i = Op(l). Combining with the assumptions 
of f{x), we have (5.2) by (5.19). 

As for (5.18), 



E / \x 2 \l(\ x \ >b) v n {dt,dx) 

Jo Jm. 2 
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< E 



[ [ \x 2 \l(\ x \>b)Vn(dt,dx) < -^E I [ \x 4 \v n (dt,dx) 

JO JM 2 JO JM 2 

< [ [ \x\ + x\\v n {dt,dx). 

Jo Jm 2 

[ns] t-1 , [ns] 

* — A 1 — J. t — _■ 



By uniform continuity of f(x) and -±= Yn=i X i = °p{ 1 ), wc havc / (7= Ei=i x %) = 
Op(l). Furthermore, 



and 



we obtain (5.18). 



, [ns] t-1 
t=2 v i=l 

[ns] 



n 

t=2 



Combining the three steps, we obtain the condition (vi) in Theorem A, then we get 
our result. 



6 Application to Unit Root Autoregression 

In this section, we use our main result to obtain the limit theorem for unit root 
autoregression. The theory of unit root autoregression is a hot topic in modern 
time series. Let 

Y t = aY t _ 1 +X t , (6.1) 
where X t is a causal process with the form 

where e„, n € Z are mean zero, independent and identically distributed random 
variables and g is a measurable function. 

If a = 1, we want to estimate a from {Y t } . Let 

a _ S? = i*-i* 

denote the ordinary least squares estimator of a. 
Let t a be the regression t— statistic with a = 1: 

, (E^n 2 -i)'(«-i) 
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In the following theorem, we get the asymptotic distribution of n(a — 1) and t a . 
Theorem 2 Under Assumptions 1-3, we have 

(/ 'sn»)*)S 

Proof. n 

n(&-l)= "^^ lJCt . (6.4) 

Similarly to the proof of Theorem 1 , we have 

(- >T y t _! X t , ^ ]T ( f A + °* f B{v)dB{v)). (6.5) 

n t=i n t=i - 70 7o 

By continuous mapping theorem, we get (6.2) and (6.3). 



7 Discussion 

In this paper, we study the weak convergence of various general functional 
of partial sums of causal processes. But we only consider the univariate case. In 
Ibragimov and Phillips (2008), they also considered the multivariate case. However, 
we can not get the multivariate extensions by our method. For the univariate case, 
we can use the Skorokhod embedding argument to get the asymptotic results of 
second predictable characteristics of semimartingale. But for multivariate case, we 
can not find a unique stopping time to embed into every component of multivariate 
Brownian motion, so we can not obtain the correspondence results in accordance 
with the method of proof of this paper. 

There are mainly two methods to deal with the asymptotic results of causal pro- 
cesses. One is the martingale approximation developed by Wu (2007). This method 
actually is an extension of Bcvcridge-Nelson decomposition in Phillips and Solo 
(1992). The other one is m-dependent approximation developed by Liu and Lin 
(2009). By the m-dependent approximation, Liu and Lin (2009) get the strong 
invariance principle for d-dimcnsional causal process. The multivariate extension 
may be obtained by the m-dependent approximation in the future research. Wc 
take this extension as a conjecture. 

Conjecture Let / : R — > R be a twice continuously differentiable function such 
that /' satisfying |/'(x)| < K(l + \x\ a ) for some positive constants K and a and 
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all x G R. Suppose that X t = (X^,Xf) is a 2-dimension causal process satisfying 
Assumptions 1~3, Then 
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